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Abstract 



Extending Furstenberg's ergodic theoretic proof for Szemeredi's theo- 
rem on arithmetic progressions, Furstenberg and Weiss (2003) proved the 
following qualitative result. For every d and k, there exists an integer A'^ 
such that no matter how we color the vertices of a complete binary tree 
Tn of depth A'^ with k colors, we can find a monochromatic replica of Td 
in Tn such that (1) all vertices at the same level in Td are mapped into 
(.~:> , vertices at the same level in Tn; (2) if a vertex x € V{Td) is mapped into 

CO ' a vertex y in Tn , then the two children of x are mapped into descendants 

lO ' of the the two children of y in Tn, respectively; and (3) the levels occupied 

r-~^ . by this replica form an arithmetic progression in {0, 1, . . . , A'^ — 1}. This 

(^ ' result and its density versions imply van der Waerden's and Szemeredi's 

theorems, and laid the foundations of a new Ramsey theory for trees. 

Using simple counting arguments and a randomized coloring algorithm 
called random split, we prove the following related result. Let A^ — N{d, k) 
denote the smallest positive integer such that no matter how we color the 
J^ ' vertices of a complete binary tree Tn of depth A'" with k colors, we can 

\—{ ' find a monochromatic replica of Td in Tn which satisfies properties (1) 

and (2) above. Then we have N{d,k) = O(dfclogfc). We also prove a 
density version of this result, which, combined with Szemeredi's theorem, 
provides a very short combinatorial proof of a quantitative version of the 
Furstenberg- Weiss theorem. 
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1 Introduction 

Van der Waerden's celebrated theorem |vdW27) states that for any positive 
integers d and fc, there exists an integer M = M{d, k) such that no matter how 
we color the elements of the set {1, 2, . . . , M} with k colors, at least one of the 
color classes contains an arithmetic progression of length d. 

Erdos and Turan |ErT36| conjectured in 1936 and Szemeredi [Sze75j proved 
in 1974 that this statement can be generalized as follows. For any positive inte- 
ger d and real S > 0, there exists an integer m — m(d, 6) such that every subset 
of the set {1,2,..., m} of size at least Sm contains an arithmetic progression of 
length d. Clearly, in van der Waerden's theorem, M{d, k) can be chosen to be 
m{d, 1/fc). 

A second proof of Szemeredi's theorem was given by Furstenberg f Fu77] . us- 
ing ergodic theory. Although qualitative in nature, this proof also has a quan- 
titative version |Ta06j . Furstenberg's proof represented a breakthrough, partly 
because of its flexibility. It led to a number of generalizations of Szemeredi's 
theorem that do not seem to follow by the original approach. These include the 
density Hales- Jewett theorem |FuK91j and the polynomial Szemeredi theorem 
|BeL96) . |BeL99) . 

In 2003, Furstenberg and Weiss |FuW03| extended Furstenberg's proof to 
recurrence properties for Markov processes, which resulted in a series of new 
Ramsey-type theorems for trees. To formulate their results, we need to introduce 
some definitions. 

For any positive integer d, let T^ denote the full binary tree of depth d — 1. 
We will use the terms of root, leaf, child, descendant, and level in their usual 
meaning. In the standard implementation, for any d > 0, the vertex set V{Tj^) 
of Tj^ consists of the strings of length smaller than d over the binary alphabet 
{0, 1}. The level of a vertex is the length of the string. The root, the only vertex 
at level 0, is the empty string. The leaves are the vertices at level d ~ 1. The 
children of a non-leaf vertex x are xO and xl. Finally, a; is a descendant of j/ if y 
is an initial segment of x. Any vertex is considered a descendant of itself. The 
empty tree will be denoted by Tq. 

We call a function / : ViT^i) — >■ V^(T„) a regular embedding of T^ in T„ if the 
following two conditions are satisfied. 

1. If y and z are the two children of x in Td, then f[y) and f{z) are descen- 
dants of distinct children of f{x) in r„. 

2. If X and y are vertices at the same level of Td, then f{x) and f{y) are also 
at the same level in T„. 

For any subset H C V{Tn), we say that H contains a replica of Td if there is a 
regular embedding / : V{Td) — J> H. If, in addition, there exist suitable integers 
a and b such that every vertex at level i in Td is mapped into a vertex at level 
a + ib in Tn, then H is said to contain an arithmetic replica of Td- See Figure 1 
for an example of a non- arithmetic replica of T3 in T^. 




Figure 1. A non- arithmetic replica of T^ in T5. 

Furstenberg and Weiss |FuW03j established the following theorem and var- 
ious density versions of it. 

Theorem A. |FuW03| For any positive integers d and k, there exists N = 
N{d, k) such that for every coloring of the vertices of Tjv with k colors, at least 
one of the color classes contains an arithmetic replica ofT^- 

Restricting this result to colorings of Tjv, in which all vertices at the same 
level receive the same color, we obtain van der Waerden's theorem. 

More than half of the vertices of T/v are leaves, yet the set of leaves of T/v 
contains no replica of T^. Therefore, to formulate an analogue of Szemeredi's 
theorem for trees, we have to measure the "density" of a subset H C V{Tn) 
differently. 

Furstenberg and Weiss defined the weight 'w{x) of a vertex x e V{Tn) to be 
2"'(^), where l{x) denotes the level of x in T„. The weight of a set H C F(T„) 
is 

w{H) = N^ w{x). 
xeH 

In other words, w{H) is the expected size of the intersection of H with a uni- 
formly selected random branch of T„ . 

Theorem B. jFuW03J For any positive integer d and real 6 > 0, there exists 
n — n{d, 5) such that every subset of the vertex set of Tn with weight at least 6n 
contains an arithmetic replica of Td- 

Obviously, Theorem B generalizes both Theorem A and Szemeredi's theorem 
on arithmetic progressions. 

The aim of this note is to offer a simple alternative approach to Theorems A 
and B. Using elementary combinatorial arguments and a randomized coloring 
algorithm, called random split, we prove the following results. 

Theorem 1. Let d, n be positive integers, and let H be a subset of the vertex 
set of Tn satisfying 



2-(ff) > J2 



d-i 

' n 






Then H contains a replica ofTj. 

Theorem 2. Let k,d,n>2 be integers. 

(i) Suppose that n > 5dklogk, where log denotes the logarithm of base 2. 
Then, for any coloring of the vertices of T„ with k colors, one can find in Tn a 
monochromatic replica of Td ■ 

(ii) If n < dk log k/6, then there exists a coloring of T„ with k colors such 
that Tn contains no monochromatic replica of Td- 

The first statement of Theorem 2 directly follows from Theorem 1. Indeed, 
for any fc-coloring of F(T„), the weight of at least one of the color classes is 
at least w{V {Tn)) / k = n/k. Thus, this color class contains a monochromatic 
replica of Td, whenever we have 



2n/fe > ^ 



d-1 

■ n 



1=0 ^ 

It follows by straightforward computation that this inequality holds for n > 
5dk\ogk. 

By its nature, the original ergodic proof of the Furstenberg- Weiss theorem 
is purely existential. We finish this section by showing that Theorem 1 implies 
a quantitative version of Theorem B with 

n{d,5) < 2^ 

Proof of Theorem B. Let H C V{Tn) be a set of weight at least Sn, and let 
I be a positive integer. We are going to prove that H contains an arithmetic 
replica of Tj, provided that n is sufficiently large. 

Let d = en, for some e > to be specified later. It follows from Chernoff's 
bound [ReTO] that 

d-l 

/ n \ 

<2'' 



V^ I "- \ ^ ■ph{e)n 
1=0 



where h{e) — — eloge — (1 — e)log(l — e) stands for the binary entropy of e. 
Therefore, as long as /i(e) < 5, we have 



d-l 



4=0 



and the condition in Theorem 1 is satisfied. Setting e = h^^{6), we obtain that 
Tn contains a replica of Td with d = [en] = [h^^{6)n\ . Let Hd C H denote the 
set of elements of such a replica. 

The set of levels occupied by the elements of Hd in r„ is a subset of 
{0, 1, . . . , n — 1} with density roughly h^^{6) > 0. Thus, it follows from Sze- 
meredi's theorem that this set contains an arithmetic progression of length I, 



provided that n is sufficiently large. This implies that there is a regular em- 
bedding / : V{Ti) -J> V{Hd) such that the levels of fiV{Ti)) in r„ form an 
arithmetic progression. In other words, T„ contains an arithmetic replica of Tj, 
as desired. If we plug in the best known quantitative version of Szemeredi's 
theorem, due to Gowers |Go01j . we obtain the desired bound. □ 

2 Proof of Theorem 1 

Let us start with a couple of definitions. 

The signature of a regular embedding / : V{Td) — > V{Tn) is defined as the 
set of levels in T„ occupied by the images of the vertices v E V{Td). Since 
all vertices at the same level of Td are mapped by / into vertices at the same 
level of Tn, and vertices at different levels in V{Td) are mapped into vertices 
at different levels, we obtain that the signature of / is a d-element subset of 
{0,l,...,n-l}. 

For a given subset H C Tn, We write S{H) for the set of signatures of all 
regular embeddings of Td in H, with d > 0. We have that G S{H), which 
corresponds to the degenerate case when d = 0, and To has no vertices. 

Lemma 3. Let H C 1/(T„). We have \S{H)\ > T"^"\ 

Proof. The proof is by induction on n. If n = 0, we have H = %, w(H) — 0, 
and S{H) = {0}, so the statement is true. 

Suppose now that n > 1 and that we have proved Lemma 3 for n — 1. Let 
r denote the root of T„, and let T' and T" be the two subtrees isomorphic to 
Tn-i that Tn — r splits into. We apply the lemma to these subtrees and to the 
sets H' = Hn V{T') and H" = H n V{T"). By the induction hypothesis, we 
have \SiH')\ > 2^'"'^"'\ Note that the weight of H' inside the tree T' is 2wiH'), 
because the levels are shifted by one. However, this shift does not affect the size 
of the set of signatures. Analogously, we have \S{H")\ > 2^™'^ \ 

We distinguish two cases. li r ^ H, then w{H) — w{H') + w{H") and 
S{H) = S{H') U S{H"). The inequality claimed in the lemma follows: 

\S{H)\ > max(|5(ff')l, \S{H")\) > 2'»'^'^(2»(^'),2»(h")) > 2"'(ff')+»(ff"). 

On the other hand, if r e 7f , then we have w{H) = w{H') + w{H") + 1, 
as 'w{r) = 1. In view of the fact that S{H) is the disjoint union of the sets 
S{H') U S{H") and {s U {0} | s 6 S{H') n s{h")}, we obtain that in this case 
\S{H)\ = \SiH')\ + \S{H")\. Using the convexity of the function 2"", we can 
conclude that in this case 

\S{H)\ = \S{H')\ + \S{H")\ > 2^""^"'^ + 22«'(^") > 2 • 2^(h')+Hh") ^ 2™(^), 

as desired. □ 

Proof of Theorem 1. By Lemma 3, the number of signatures determined by 
H is at least 2™'-^-'. By the assumption of Theorem 1, this quantity is larger 



than the number of signatures of size smaher than d. Therefore, S{H) has an 
element of size at least d. In other words, there exists a regular embedding of 
Td in H. a 

3 Random split and fit — Proof of Theorem 2(ii) 

To prove the existence of a coloring which meets the requirements in Theo- 
rem 2(ii), we use a random coloring algorithm which will be called random split. 
We color the vertices of T„ by the positive integers in order of increasing 
level (breadth first). While performing the coloring procedure, we maintain a 
list of "forbidden colors" for each vertex of T„ not yet colored. These lists are 
empty at the beginning of the procedure. When we reach a vertex x, we assign 
to X the smallest permitted color: the smallest positive integer c that does not 
appear on its list of forbidden colors. If x is not a leaf, we update the lists 
associated to its descendants as follows. Let y and z be the two children of x, 
and let Dy and D^ denote their sets of descendants. For each level I larger than 
the level of x, we make an independent uniform random choice and either add 
c to the list of forbidden colors of every element of Dy on level / or we add c to 
the list of every vertex in Dz on level I . 

Lemma 4. The random split coloring of r„ admits no monochromatic regular 
embedding (replica) of T2 . 

Proof. Consider any regular embedding / of T2 in T„. Denote by x the image 
of the root of T2 and let y and z be the images of the leaves. By definition, y 
and z are on the same level I and they are descendants of distinct children of x. 
In the random split coloring, x receives some color c and at the same time the 
c is added to the list of forbidden colors to all descendants of one of its children 
on level I. In particular, c will be forbidden cither for y or for z. Thus, / cannot 
be monochromatic with respect to this coloring. □ 

Lemma 5. Restricted to any one root-leaf branch of Tn, the random split 
coloring is equivalent to the following "random fit" procedure: We color the 
vertices one by one, starting at the root. For each vertex, we consider the positive 
integers in increasing order until one is accepted and give the vertex the accepted 
color. When considering the integer c, we accept it with probability 2"™, where 
m is the number of vertices (along this branch) that have earlier been colored 
with the color c. In particular, we do accept c the first time it is considered. 

Proof. Restricting our attention to a single branch simplifies the procedure of 
updating the lists of forbidden colors in random split: once a vertex is colored, its 
color is added to the list of each uncolored vertex independently with probability 
1/2. Equivalently, in the random fit procedure, if a color c appears m times along 
the branch up to a certain point, then for every remaining vertex y, the color c 
appears on the list of y with probability 1 — 2^™. These events are independent 
for different pairs {c,y), so deciding them can be postponed until the particular 
vertex y is colored, as done in random fit. □ 



The key to the proof of Theorem 2(n) is the following statement. 

Lemma 6. Let n > 8 and k ~ 2\3n/ logn\. For any branch of Tn of length 
n, the probability that the random fit algorithm uses a color higher than k is 
smaller than 2^""'. 

Before proving Theorem 2(ii) in its full generality, we show that Lemma 6 
implies the result for d — 2. Indeed, using the fact that r„ has 2"~^ branches, 
the probability that all of them will be colored by at most k colors is positive. In 
view of Lemmas 4 and 5, this means that the coloring obtained by the random 
split algorithm does not admit a monochromatic replica of T2, and it uses at 
most k colors with positive probability. 

Proof of Lemma 6. Fix a branch of length n of r„, and consider one by one 
the sequence of all choices made by the random fit algorithm. The maximum 
number of choices is iV := ("J ) , and after each choice we either accept or reject 
a color. Let pj denote the probability with which we accept the color at the 
j'th choice. 

Set Xq — 0, and for any j > 0, define the random variable Xj as follows. Let 
Xj = Xj_i + Pj if random fit rejects the corresponding color considered at the 
j'th choice, and let Xj — Xj_i + pj — 1 if it accepts. If random fit makes fewer 
than J individual choices, we simply set Xj ~ Xj^i. Obviously, the random 
variables Xj {j = 1,2,3, ...) define a martingale with differences bounded by 1, 
and Xj stabilizes for j > N. 

There are exactly n choices at which a color is accepted, and the correspond- 
ing — 1 terms contribute —n to Xn- If a color larger than k was ever used, then 
every color up to fc + 1 must have been used at least once. For simplicity, we set 
I = k/2 + 1 and use the fact that each color i < I must have been considered at 
least I times, and every time it was considered, it gave a positive contribution to 
Xn of at least 2"™*, where m^ is the total number of vertices along this branch 
that were assigned color i. Thus, we have 

^ ^ I P 9n5/3 

Xn > > :; n> —r-pT — n> ^ n, 

"^ 2"' - ni/3 - io„2 ^ 



1=1 



where the middle inequality comes from the fact the ^^ 
convex function. 

Azuma's inequality |A1S08| bounds the probability that Xjy = Xjy —Xo>T 
by e~2iv. Substituting T ~ ff"^ ~ "■> we obtain the desired bound for the 
probability that a color larger than k is assigned to some vertex. □ 

Proof of Theorem 2(ii). We have already seen that for d = 2 the statement 
directly follows from Lemma 6. This means that there is a fc-coloring x' of Tn' 
with n' = 0{k log fc), which does not admit a monochromatic regular embedding 

ofTs. 

To tackle the case d > 2, let n — [d ~ l)n' and split r„ into subtrees 
isomorphic to r„', in the usual way: the levels to n' — 1 form one subtree, 



the levels n' to 2n' — 1 form 2" subtrees, etc. Coloring each of these subtrees 
separately according to x' , we obtain a coloring that admits no monochromatic 
regular embedding of T^. □ 

4 Concluding remarks 

Furstenberg and Weiss generalized Theorem B in two directions. First of all, 
instead of binary trees, one can consider ternary trees or, in general, trees in 
which every non- leaf vertex has s children, for some s > 2. 

Obviously, our approach also applies to this case. Let Tn.s denote a full tree 
of depth n with this property. The only difference in our argument is that now 
the weight of a vertex x G V{Tn^s) at level I has to be defined as w{x) = s^K 
The weight of a subset of ViTn^s) is the sum of the weights of its elements. We 
can define the regular embeddings of Td^s in Tn^s as in the binary case, but now 
the s children of a vertex v G V{Td^s) have to be mapped to descendants of 
distinct children of the image of v. Instead of Lemma 3, now we have 

Lemma 3'. Let H C V{Tn^s)- Then the number of signatures of all regular 
embeddings of Td^s in H satisfies 

w(H) 

\S{H)\> J2 (^-1)"''^'> ■ 
(jes{H) 



Lemma 3' readily implies the following version of Theorem 1: 

Theorem 1'. Let d, n, and s be positive integers, and let H be a subset of the 
vertex set ofTn^s satisfying 



w(H) d- 



> 



E' 



(■) 



Then H contains a replica ofTd.s- 

Using Theorem 1' and Szemeredi's theorem, one can easily deduce the cor- 
responding version of Theorem B: Any subset H of the vertex set of T!„_s with 
weight w{H) > Sn contains an arithmetic replica of T^^s provided n > n^^d, s, S). 

As another variant of their result, Furstenberg and Weiss considered arith- 
metic embeddings in not necessarily full trees. Nevertheless, in what follows, 
we use the same definition of regular embeddings of Td in T as for embeddings 
in r„. 

Theorem B'. |FuW03J Let a be a positive real, let s > 1 be an integer, and T 
a rooted tree satisfying the following three conditions: 

(a) every vertex has at most s children; 

(b) every leaf is at level n; and 



(c) the number of leaves is at least 2"" . 
Then there is an arithmetic replica of Td in T provided n > nQ{s,d,a). 

Proof. Let us define the map g : V{Tn) — > V{T) as follows. For the root r 
of Tn, let g{r) be the root of T. For any non-leaf vertex v S V{Tci), let g map 
the two children of v to the two distinct children of g{v) which have the largest 
number of descendants that are leaves, unless g{v) has only one child. In the 
latter case, we map both children of v to the only child of g{v). Let 

H :^ {v e V{Td) I g{v) has or at least 2 children}. 

Since g preserves levels, it maps every replica of Td in H into a replica of Td in 
T, and every arithmetic replica is mapped into an arithmetic replica. 

Note that T has at most s'^(^)^^ leaves. Thus, by our assumption, w{H) > 
an/ logs. By Theorem B, if n > no(s,(i, a), then H contains an arithmetic 
replica X of Td- Consequently, g{X) is an arithmetic replica of Td in T. □ 
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